Rule Induction Using a Reverse Polish Representation

نویسندگان

  • G. F. Davenport
  • M. D. Ryan
  • Victor J. Rayward-Smith
چکیده

It is often necessary to extract simple and understandable rules from databases containing inconsistent records and/or irrelevant elds. In this study we have assessed the feasibility of using a genetic programming (GP) approach to extract a single rule to describe such data. Instead of a tree structure we use a Reverse Polish (postx) representation. To assess the performance of the GP algorithm, it is compared to a steepest ascent hill climber algorithm and C5.0, a commercially available data mining algorithm (Quinlan, 1997). On the datasets used, the GP algorithm out-performs both C5.0 and a steepest ascent hill climber in the simplicity and, in most cases, the accuracy of the expressions produced.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Flexible, Adaptable and Scaleable Business Rules Management System for Data Validation

The policies governing the business of any organization are well reflected in her business rules. The business rules are implemented by data validation techniques, coded during the software development process. Any change in business policies results in change in the code written for data validation used to enforce the business policies. Implementing the change in business rules without changin...

متن کامل

A Novel Rough Net Approach for Rules Representation and Verification

Abstract: Rule based systems store knowledge as a set of rules and reason over them to solve problems. A new hype rough net approach for rule generation, representation and reasoning of knowledge-based system using rough sets and Petri nets is presented in this paper. A Rough Petri Net (RPN) is proposed for representing knowledge and formalism for the verification of rule-based systems. The mai...

متن کامل

A Many Threaded CUDA Interpreter for Genetic Programming

A Single Instruction Multiple Thread CUDA interpreter provides SIMD like parallel evaluation of the whole GP population of 1 4 million reverse polish notation (RPN) expressions on graphics cards and nVidia Tesla. Using sub-machine code tree GP a sustain peak performance of 665 billion GP operations per second (10,000 speed up) and an average of 22 peta GP ops per day is reported for a single GP...

متن کامل

Application of rule-based models for seismic hazard prediction in coal mines

The paper presents results of application of a machine learning method, namely the induction of classification and regression rules, for seismic hazard prediction in coal mines. The main aim of this research was to verify if machine learning methods would be able to predict seismic hazard more accurately than methods routinely used in Polish coal mines on the basis of data gathered by monitorin...

متن کامل

Strong Induction Principles in the Locally Nameless Representation of Binders (Preliminary Notes)

When using the locally nameless representation for binders, proofs by rule induction over an inductively defined relation traditionally involve a weak and strong version of this relation, and a proof that both versions derive the same judgements. In these notes we demonstrate with examples that it is often sufficient to define just the weak version, using the infrastructure provided by the nomi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999